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ABSTRACT 

Questionnaires were administered to 361 participants 
of a standard 26-day Australian Outward Bound Program to examine 
systematic change and stabiltiy in multiple dimensions of 
self-concept, to test hypothesized effects of Outward Bound 
participation on self -concept , and to explore methodological i ssues 
in such studies. The participants (aged 16-31 , 75% male) had taken 1 
of 10 standard residential Outward Bound courses offered between 
November 1982 and May 1583 . Self-descr ipt ion questionnaires were 
completed 1 month before the start of the program, on the f irst day 
of the program, and on the last day of the program. Findings showed 
that participation in the program produced increases in the multiple 
dimensions of self-concept over the 26-day interval , demonstrating 
the program* s effectiveness. Counter explanations for the findings 
were examined with a variety of different approaches and did not 
appear to be viable. The psychometric properties of responses to the 
self description questionnaire ( reliability , dimensionality , and 
stability) , coupled with the systematic relationship between the size 
of shifts in the questionnaire scales and the scales ' a priori 
relevance to program goals , supported the validity of interpretations 
based upon the questionnaire and its use as a criterion measure in 
intervention studies. (Author/NEC) 
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Multidimensional Sel f -concept s: The Effect of 
Participation in an Outward Bound Program 

ABSTRACT 

The purposes of this study are to examine systematic change 
and stability in multiple dimensions of sei f -concept , to test 
hypothesised effects of participation in the Outward Bound Program 
on Bel f -concept , and to explore methodological issues in such 
studies. A totai of 2.7 groups <N^ S6l participants aged 16 to 31, 
/"Z'L male) participated in a 26 -day residential program. 
Far 1 1 c I pants completed the Self Description Questionnaire ill (SDQ 
III) of)e month bf::'fore tfie start of the program (time 1), on the 
fir^^t day of the program Iti/ne Z)^ cuui the last day of the 

program < 1 1 nie . Participation \\^ the program produced increases 
1 f) the multiple dimensions of sel f -concept over the 26-day 
inte/rval, demonstrating the program" s effectiveness- Counter 
t^:: p 1 r.^nat 1 ons for the findings we^Q^ e^ Alined witfi B variety of 
Jifff?refit approAoheS; and did not appear to be VtabiS- Tho 
psy^^u}met r 1 propi.^rties of r er^ponses K A^ tfie WDi} iil 

el I r.^h 1 1 1 t y , d 1 men^_^i ona 1 1 t y , anti stahility - c©^pl^<^ witfi tfie 
Syst fjinu t i c: r el ^^ti ons^ 5 p betv^fe^-n ttie %i ze. of shifts xw SDQ III 
sci^al es and the Sc^les^ a pr5cri relevance to program goals, 
support the validity of i nt<?rprelaii ons based uipc^o the SDQ HI and 
iK.^., c\\\ er iteriOD nieasure in i n ter vent i of^ stydies- 
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Mul ticiimpnsxonal Sel f -concepts : The ^-Iffect ai 
Par t i c. i pat i an in t he Out v^ar d Dt)Uf \d Pr ocjr am 

^? ) 1 1 ^ : ^y.) SI R 

L' nti J ncement of ^el f -cnnctfpt is widely vaKied as a desirable 
(;joal and has L^eef^ p(]<-sitt?d as afi i n t t?r vc?fi i nq proce^*^ that inay lead to 
( tiaru^es i!) othe^r (jutcomeps. ^(jwever^ systematic reviews of self - 
con^cepi re^i^^rch e/7)pha£i^^ the 1 ^acK of thi^orGtica] basi^ in most 
s t uiiJ 1 , the poor equality ui nieasuremen t instruments i^sed to ciS"r>esL; 
se 1 f - Of k; ep t , met hocio i 0(j x a 1 shor t r om i ngs^ and a cjofu-'ral 1 a( of 
cof.siv^tent findinq^. (e*c^-. burns, 1979; Shavelson, Hubner ?^ St.af^toru 
i^^/6: ISfKwelsc^n .^y Mar sfi , in press; Wel]<.. ^ Mar wel i , IV/^b; Wylie, 

1V79). belf concept, like? many other psychological constructs^ 
s\ifft:^rs in that ''everybody ICnowc, what it is" and researchers du not 
ft^tM c oiTipe 1 K'V'ii to provide any t hr'or t.^t i c a I d^.^f i fu 1 1 of)*.? iji wtiat tfiey 
are mea^umng. In an ottei«pt to remedy this sitLialiLin, :)fiave] vu^n t^t 
■ ^ I . po'.^i tei.i A mul i I faceted^ hierarchical mod«l of sel f 

viauept. Wh 1 I i? the nu ( 1 t i d i nu:^n s i t u< 1 i t y of s^^lf (.ofi(.o|.it Hot: 
t. m rvtM ScU 1 y arct^fjtecJ, the a'"^si.unp 1 1 or: fia^- reciMved ^.t r tmuj <;upport in 
c> r iH i-^ni r *vv i r e^"t.?.u t i ith 1 1 lited t^'/ t lu^ ! )h ^wt j i *.:-on model 

(Shavelson ^ flarsh^ in f>r»ss>^ snd from research by other 
investigators U?.g.<» BoersmA I* Ch^p*'*^*^; IV'V'*; Pov^e?!^ & flatter ty, 
l^^vU;- f in'-r.inr] Courtney , 1984; H..irter, 17^3?; Gc-ares & Scares, 
l^^Tj/^. Ill tht:ir r'eview '.>h vf-> I < .r an-i Mr^r^^ti trmtended tti^.^t tlt^» 
ri'.^l it M.-f'-^fiip l>etw*^e:) '-elf ' MfM. t't.^t and ot^fcir con^tr"ucL^ c^Minot Ij^:' 
aci» ' :H 1 1 y MMLier .stocni if t h i mo 1 t. r d i hum v. i ruw> I i t y is iqrfof «.Mi. 

Ton t. r u*:. t v<il i(iity stiuj;f><^, in '">elf c (.♦n c: t t (..in be c i a s '.y I f i t-»d as 

. t; ' o 5 ' I ' t w( 'T k„ or tu^t wtn ^' i- nelwor K f es^e^r^ch . VJi t h i n - netwf.^r k Gtiidi 
t^'^-t ir^e o^ i t t.'f )L of h /pi ,>r. ht.^'O / i fa^.oto.. of ^ .^^l T "Concept o.^.rj.^ 

- r 1 c ' ^ I , ^7 0L 1 a i ^M)yi <:u • rin*e<n i c. ) t hr vJii h [.) r nc knik ir es 1 i ^- e f t or 
'■•i^o vo .u ii! fiH ! i t i t r ^.u t mo J f i nu^t hod <"<n<.-^ 1 v v^es - I^et v^eeMo f M?t: worJ 
' t u . 1 M » - t t Oinp t t . ) ilr^iH .f V > t f n t ^ ^ t I u.M »r o I i ( c< I 1 . c^oos I ^t/gn t patte^rn of 
r »'Iat n.ui-.> bL'tween ^:>t?l f Cor*cept '3n<i ftxt^r?>al i r iter i.i '■;uc h a^.-^ ^jf tK-n 
•-{M^ ro:.'i..rt fnf .-^as-nr es , ah i 1 itv/a( ioevenuutt ifuiit ators, st^,*l i ■ r.oni.ept 
r-.'^^o-'-is Miftvrred l)v 'ooniti; /\nt. Others^ behavjor^ SncJ impact of 

^yS ^ ^ ".n.^ t 1 ■ • ; f d er V en t 1 ( )o ^ thc^t are £ie^^:gned t() enhances ^'^o { f ■ ( >n{ e^p t , 
I n\ r 1 :"• 1 f ! 1 y , r esf?ar (.; hc-.^f •> i f it or p(.'r t [K)th Mu-.^^^e-* t»f?ipf i a s* *s < 
.« ^ ^ ^ t u to d '■ '?TH 'f V- ) t: r t t f s a t ■ t: f»f f -i <vl c i f f->f i r y to - o i ( < ,i i i •/ 
rf'i.^ted t t.» ^3 .If t 1 ( o 1 <\r- facets of ^el f conc.fe>pt to which Miv-y ^tiri ^o^t. 
o>g,o I > rf^Iated^ af)d *SdUb st^ijat w^l 1 y I ej&o f-t?l^ted t^' other facet' <h 
f •- r or ;( or^t- F-or aaifj 1 , in c:^ n i o of f h*:» ^-i(.^lf 

■ , . 
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concept /academic achievement r el at i onshu p • Mar^fi <1984> reported 
tf^^t verbal and matht^mat ica 1 achievement's we . e subr^trant i al 1 y 
iorr elated with Verbal and Math ^e J i -( ofu_ ep t s , hut were nf*t^.r 1 y 
i.if H... t.^r r el at ed with fionacadenii t. area^ of self coru-ept- 

Stabi l^ity ^nc! fr^^3n(Je l ^^ ^el f -Coficepi. . St*i f c.ofu ept i n vt^^^t i y at ur^ 
f tu e an i fit er est i fig dilemma. T fieoret 1 1 al 1 y , arui f r ofri the perspective 
of measurement theory, it is cie*>irable that se i f -c(incept he relatively 
^■.tahle cjver time- yince c^t^ru^ral se 1 f Tonc. ep t. , avM) even particular 
^.Acets of sel f -c. OfK ep t , are tiaseci (in ^juc.h a wi cJe r aru^e of e^ per l en(. t? , 
evtMi cjrami.^tit^ life events and i n t er ven t i < >n^, may fiavi^ only c\ mocJest 
ef f t <jn se 1 f (jnc ep t - Gome r esec^r cht:^r^7 also argue tfiat it i *:> 
important to a pt^rson's mer^tai health that *:3€?1 f - conr ep t be relatively 
st entile. Nevt^r the 1 es*3 , mucf) cm the intt.>rc>st: in ^>elf (.c)nLf?pt i^ 
r.iirerted toward <; hariyt?s in self concept- as IS the case i ri most 
inter v^^/ntic^n stucJies that employ Sf^lf c:(3nc?.^f)t n)ea*jures- iiel f concept 
- annot l)e perfectly stable anc'- still be r-esj.H)nsi ve to dramat.ic life 
G-veiflts or system<^tir. i fi t c^r vi?n t i ons , arui fierein 1 i e^^ tfie diieinma. 

fiar^h^ Smith, Barnes ami [hitler- (1983> ami fled c ruAfujes in 
itiM I t I d 1 tnef 1 s I on a 1 sol f - t:onc;ept of p r e^uJu 1 esf en t chiicjren in ci ^tudy 
ttw/it h.^v> particular r*?levance to tfiivi d i 1 emau\ - first, their resf-?arch 
;dt-itti f i FHi mult i^-^ie ^-Ms-- f (Dn^.r-^pt t t^.At were (U^r i vtnj f r cjh) t fu_' 

-h ,-wf * 1 ( >o modf*] stu)w^.ni tftvU. t hev^t.^ ouiltipio K.u. ot wi*re 

' t t.'i'TK^ t 1 c. a 1 1 y riMateci to ^•)elf LcnH'epts f.M- r t'Mi by tea( i '.'^ -ukJ t(.} 

o^'riniv: h i vf:^t;ien t i nd i c, t ■:>r ^ ■ . ^}fMcnu1, t tu.>y foLinri tr)at t f)t» facet.^ 

' ^ ' M ' 1 + t t If t ( . L'p t r ^? r t.> i t 1 . ! V s t . <b I * ^ , * ^ m { for y > h Uicj (J w J ( ir ^ j . 

i^-i'^^i, t hr^y fouiui t ru\t changes m \ (.luu c'pt. thr.it Uiij ncc or wc^r e 

roli^A!)!^.^ -uici v'^- 1 ofn.it 1 \ , F|r1aliy^ ttuMr re^>v(lfs ^ - u{] c ^c"^^ . t mi t fiat 
changt^ti in ^^olt ^i<fUf'|>t wert^ (nu 1 t i d i mens i onal s»uc h tHat Minnas i/^a-.^re 
Specific t 1^ p.ar t 1 r_ o i d 1 fiif .-o.o ofi<,:. u f ^'^Mif.opt ratf)(-T tf^.^f, 

'f -f-T 1 1 1 ! f g r c,)--.s \ dimensions;. rhu^,, t?p,»y ^suqcjest eci , it is 
pss' ;t-lo fc)r 1 o t t-^r V i.M U I f HI t^^ hcJvo id^^r .d o £if fe^ct ^'O ^mo^ 

^- * i ■■ ' * ! ^ y-'\ '..n -.r>M '(,wh'^-o, / ^ if it i)<i'- ^ I s oh v. , t .^n f .i a i 

^^vf-r.^il sei f .o>ru t'pt . !h^^,o ro,uit^. ,5m^v off^^r ..n 
' oi't >t.)It.^ t c.^mpr oau *>f^ to hotl^ s;tje^.-. r» f the (IMefTima; ',,e 1 f cooif/pt is. 
r^'i'-tivoi^ -d.-%lilt^, hut f f*- M,,,.mm*-^ ^j,, t.^W.eplare ore roiiat.>]e 

and >p'*", 1 t I to p<.\rto,ul»u d i f!w ^n s i f )o ^ > < d soW iufitopt. f^jw^^v^^ , ti^esi^ 

•^■'"•^ ^^^or s wf»r>.^ not ahlr.^ t n) d Ofnon t r o tric:^ the s v '. d. t-ouU, i < torj 
^ ' * • 1 1 ^* ( rjrUKjt-. io fr)olti|>lo ^.u,>t^, of ',*..if r t,)ot ipt w^r e ^, i[ j t /.f r t j i 1 y 

' ^ .r r o 1 1 ^^jith r f,,jM,,^>^, If. othc^r Ccn-^t rvicts^ artd iKey Hoi s^,^ &,<afrofie 
^ ^ ^ soM iijfKnpt [^r , .ji p ^,.t by ( i ^MTi,o i . I n t er vof ^ t i : ^ 
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The study of changes in multiple dimensions of self-concept due 
to systematic interventions has important practical and theoretical 
implications, but is particularly plagued with methodologically weak 
studies, with studies based on poor measurement instruments, and 
with a paucity of consistent findings. While a systematic review of 
^uch research is beyond the scope of this article, reviews by Wylie 
(1979) and by Scheirer and Kraut (1979) are particularly relevant. 
Wylie reviewed research propos^ing to alter self -concept as a 
consequence of pB^ychotherapy . She concluded that: an "overview of 
all the substantively summarized research gives no support for the . 
belief that allegedly therapeutic or growth-producing group 
experiences affect the over--al 1 level of self-regard of volunteer 
'normal,' young adult participants" (1979, p. 642-643). Scheirer and 
Kraut (1979) reviewed the effects of academic interventions on self- 
concept and academic achievement; they also -Failed to find 
consistent pattern of results or many studies that produced any 
change in sel f -concept - Wylie cautioned that there are many reasons 
why we should be suspicious of systematic changes in self-concept 
produced by interventions (e.g., placebo effects, acquiescence to the 
e;i per 1 menter , etc.). However, when the trends are not positive 
despite these apparent biases, then the hypothesized changes in 
sel f --cQficep t are particularly tenuous- With regard to psychotherapy 
research she suggested two alternatives; to reject the popular^_and 
plausible notion that therapy shoi.dd improve sel f— concept , of to 
cesser t tfiat the notion is too intuitively plausible to abandon in 
the face of null findings and to pursue, methodol ugi cal 1 y and 
t ^)eor eti cal 1 y more sop^^l st i cat ed research, 

Why doesn t i ntervent i on r esearch demons tr atte- more syst emat i c 
c^^a^ges in sel f -concept 7 .Many probable reasons exist, but two are 
par 1 1 c.ul ar 1 y relevant to the |.)resent. m west i gat i on . First, /nost 
r e^sear ch uses ill-defined measures of overall sel f -^concept that 
ignore the mul t i di mensi onal i ty of the construct. If self ^concept is 
multidimensional, as asserted in this study, then some areas of 
sie 1 f -c one ep t will be logically related to tfie goals of an 
intervention w^1ile otfiers wi ) 1 not. Second, the siee of the likely 

f f ec t r el at 1 ve to the pr obab 1 e error is t yp i cal 1 y smal 1 . Tfii s will 
I'e c:r.xv.^e w^)en the? i n t er v'»?n 1 1 t?f ^ i or when tlie ru.imber of 

subjects in the study is small. I he primary purpo^^e of t^le present 
investigation is to examine tf)e effect uf a lujwerful intervention, 
the Outward E*ound program denser ib«d bti-low,. on mui't^pie di^n^ridions of 
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seif -concept for a large Bampie of par t i c i paf^t^ . 
The PMt. ward Bound Program. 

The first Outward Bound School wa^ developed by Kurt Hahn in 
W-ales in 1941 as a way of traiiiifiq your>q L^r i 1 1 s. h seanifpn to withstand 
the hazards of naval Warfare; the initial program wais desiqied to 
allow young men to realise thoir potential, and to develop ^a stronger 
character and a will to survive?. SinLc? 1941 more than 35 Outward 
Bound Schools have bi^en established in more than IS countries- The 
standard Uutv^ard Bound cour^_^e is a -day residential program far 
young men and woriH^n 17 to 29 year^; of age <or older v^ith special 
permission). The course is but!i physically and mentally very 
demandirujp Participants enroll \n the course for many reasons 
i TU~. i uci i fiy the enjoyment and challenge of the? outdoors activities, 
broadening their outlook on life, developing character, and enhancing 
1 e«^< jer <^,h I p potential. Some are sponsored by employers; son^e are sent 
by their parents; some are sponsored by social service organizations; 
and some sponsor themselves. Historically, Uutward Bound par t i c i pjants 
have been young men, but an incr(?asing ntimbc?r of young women are 
P ar t 1 c I 1 1 f uj in mi^ed^sf?)< or all-ft?m.4le groi.ipcs, 

Tar 1 1 c i paf) t work in groups of aboiit 12, sharifig must of the 
c^c. t 1 V 5 t 1 tr->s, nuifiy of w^l 1 ( h r€»(^uiro c cm)s i dc^r ab 1 e group or gan i a 1 1 on 
,4fw1 < ( >Mpf->r a t i c?? 1 runofuj thf^ i n d i v i ( iua 1 ' , . A^:? part of the f)r ogr r^jf^ each 
pi^^r v>c>n follc^ws a pf?rt-,onal phyvwCc^l i i t m^^^,s pr i}{:]r am wtuch jficliules 
r (>s^.: cuuritry riins, rc>rK tli/nbii^g, .u?(i t.ytj)i^r vigorous ac:l i v i 1. 1 ^T:, . 
f'.^r t it if^juit^> t.ravt^I \ip tc^ 3O0 km on f(.i-.i^ \n rniighi (notin t a i noii' . 
terr of^, or p._uid]i{)() tTuM r owrl craft if) Swiftly flut^ij^g r"2ver-^>, 
'rJh I 1 e tru^ strong enjphasi*; l*^ j f) f ^ y^. i r. a 1 , outdoor^ ac. t, i v i t i is a p.<r i 
r, f i'Mlward l.'ourxj, [.^hysiv^'U fiine'oV- is nut thi^ pr i m«tr y quf<i- 
i r i 1 M « j , f- 1 I. h a r d "^"^^ argues tfi^t t h** pr? v^-, i I <-.^<,: t i vi f i es nr an 

f t i^'. t 1 ve aHjnlium i or "tt)c» \u*r tu r c-* (.>c}n i :■: ,.u)d ur i fJer" ',t cUh1 fu'. own 

w^s^Kf '^'sses^, st r r:riytjis, (K)d r >f.ir c i - ^ .wvj tfuis find within tii'HS(-'2f 
^ ^i' wht.^r t^wi t f la 1 t. o {?ia"-.t^:r tl)n difficult c.u>d un f (\fn i I ' ar ( 1 >^ / , 

iuf^uiarivs overt^ e^Lern^A^ competi i:i on is deempbes i - r^j ami 
^ f'U i V. (jn ^'the w^li.rfwrl (. ( hmJ'^.* i t i ( m > uf Uu.e if^Jividuai iNith 

J :.T)-.f.lf, tryifMj to c-AifMfc.>ve qrr^at.e.'r perSCOal Q^oals, which for^ 

^ Lm.u..!*-. uf ( i )n«f.*et. i t. i < if) b*^twH»-'fi whe^r r.^ t.!>4-^ i f uj i v i dua i ^ st:«indQrds 
^ '^♦^ lif'Mi, wf'urt* hi^ w^M^t^^ th^^rn |u>" (f^'Mfiards, 19//, f)- 

The establ s shfliiHit anci fulfil iment of per^nal and group i^^^^ls zn 
<-it'Jt-:.jr f:.hy'-3i( al ac t, i v i t i , ih^' group experience.; ^^nd the 
< 'P; rtU'f.ity t .;.') ^'^^^ pejT 1 enC e and n)'*:^tr'^ ' ^ t ^ < ^ ' • . f > < i ^.^ i t n i ( )r v ^ .>r 
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important components o-f the program. In addition to increased 
phy«:,ical -fitness, goal*:! of the proqram include improvtuJ awarent?&o of 
self, sel f -conf i den.^e, initiative and sel f rel i anre , cooperation, 
awarencsi^ of others, taking r e^ponsi h i 1 1 1 y for self, and th«.' mastery 
of stressful \i tuat i on'^ (see Ricriards, 1977, for further discu=isicn 
of tfifc? .standard course and ttiB aims of the Outward Poind c oitrst:* 
examined in this study). 

Outward Pound Research i^tucjies, A 1 ar amoufit ui rei^earch froni 
all cjver the war 1 (i beef> izondueted to evaluate the impact of 

fjutward Bound programs and these have bet-n reviewed t)y Godfrey 
(19/4), fvichards ( 1977), and Shore (1977), Sfiore (19/7) noted that 
mo^-.t of th€? research is unpuhl.ished and methodologically flawed, 
leading him to argue that ''Ofie must tonclude, overall, tfiat trie 
r est>ar c:ri literature of Outward Bt:und is weak." None of the BO plus 
studies ri>viewed by Stibre u^ed a true eKper imental desxyfN anti few 
<^ y< ti>mat i al I V e-:<amine(j ttir^vits to the validity of \ n t er pr f.^t at i ons 
t^.^sed on pret Kst -post test Jifferencf- Scores- If) <Hie (^f t hf* nujst 
c ^r-.>tniiv ct)n->idered -.tudle^;, Smith, Gabriel, 'Jc:hott and Padir^ 
MV^r-, ^>.4"' r ec (jqfT 1 -ed the potential value- <.if a randomly assigned 
c.f'trol i.r nup t)u,t arciutui ttiat: " I n t er t er er u r> by the researchers into 
par t I cu^<^*^^^ Si»UM.tion and p 1 ac t-Moen t f h.m es v.ar" y m i mp i emenfc i ng a true 
experwDenta) desigr^ was impassible for ttiis -.tudy, finwever. 1 he 
^{,.,- . ..ion was math* insteau tn use tfu- interrupted time series ifesiyr^" 
■ ;'.4>. {\r^ tht.^ t}cA-n'. of thcMr i--tu(.iv Ihey fouoii evidence th/,t the 
.f.'^^r.un h<vi r\ pc.r-.>i t X vf » impcut on sell av^-ertwui (taktng personal 
r e-i'^ ^f^^^ ^ h 1 i 1 t V and a. 1. 1 vel v r on ( r or i m; nr»w or s^t r essf ui si t uat i c)nL>) 
\ \.^el f e'-'^t, likens \ 
':-(\\ i r u(us.^f>t IS tht^ most f requenl I y e%an.^ ned psychological 
^c^s.triut n the Outward Bouf^i literature, f<i(hc^rds <19/7) r r^vj owed 
t [ ;u(;r i^t a (. a 1 and ^MT»piri(<0 ba<.t:>- f^.^i^-the impart of Outward J<our;d on 
-,1 1 f -^nfu-ept afui , i^m \ that " Uu- ohv,er vat i of v. of rA) rr-any tens of 

tfiou^^-uul'-. student--, t: hr ouuh^ ujt the wn- 1 d wtu; h.w(,> (Ofppli-ted ttuj 

(;.;fwc.srd r.or.fui f.rwirses t I ear ! y 'upport the mfiupuff^ (lutward H<iund 
{■ the 1 nd 1 V 1 dua 1 " cM/.iepf nt <.elf. These <3b*- er v<>t i ons r»ave 

sn uaif.t.>d i fu..r r.^asi MC,' ^upport> espe( i a H y ^u n.< e the *fu ({ 1 V^^: r s from 
S^owifH^ body (^f re^.earch" (p,76). tslh i 1 ih<\\)y o\ iht.^ stn^lir-s m 
SSf.^. ■ I ^■.ss- revM-w also reported tf^.at pu^wTp<»fion in ^'^u t v^^, ir (j 
fw.)ur.o luu.^s to a more positive 1 ^ onc<^i; V^e concluded th^l the 
f 1 f (i I f^'r- wer*> ftu;<:^^ ine t h ( jd.i 1 no 1 / flawad^ and oft*^n ^on^usud. 

M-w**vf;^r , important f ) r nb ! ( *fTi'. in i \)^.: me a sur t Mn< -f 1 1 -eif (.nncei^t 

8 
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discussed earlier mean that it is difficult to disentangle 
measurement problems from the. impact of the program. Henct^ it i 
r>ot surprising that Shore found some methodoi ogi cal and empirical 
1 ficonsl stenc 1 es when reviewing fsel f ctnu. ep t sttidieis in the Outward 
BoufuJ literature. An important focur^ of tt\B present ^tudy *s to 
e^^amine psychometric proper tie^.s e^f a well devi^ioptni sei f concept 
inv>tr umetit ancJ the impact of the Uutwarci l^oiifid program on r espbnses 
to t^ll«ii I n^v.r umef it . 

METHOD 

MK\T^sur ement Instruments. 

In addition to t e bac i qr cund/demoqr ap^}i c i or mat; i On normally 

ccll.Mted by the Outward Dound fr oqr am, all participants were 
rec^uired to complete the Self Description Questionnaire (bDQ) III 
afKi tfie Rotter Locus of Control (LOO s( ale desc:ribed below, 

L^pQ III-. ^^^^^ ^'^i^^^ ^^nc> of a set of self report 

1 r)<. t r limcr^ t designed to fft^asure sel f -concepts of pr £?adoi escent s 

(SI)U), early-adolescents (SDQ II) afid 1 at e - adoi escent s and young 
c^iUilts (SDL? Ill)* The set of SDG i n^:, t r umen t s is ba^ed tipon the 
^^hr*'/^.*l \M>n model oy ^^df (onv.ept. (Shavelv^on, Hubner ^ Stanton, 1976), 
ttii.^ facets of r.el f - c cjnct?pt [ir opo^-^ed in that mculel , and earlier 
r <^^:.t-ar c h with the SDU instruments (see Mar B^rnf^^> Hocevar , m 

prt:'^-^; M.if sh i^)i\v^^l v-yijn , P ^4 ■ . Numerous exploratory and 
( < >f i f i r (Tiat r,>r y factor analyse?', uf r t'^>p'.M to the SO^ i n^::> t r runer ^ t 'i, 
fw-^vt' I (it-rU 1 f 1 t;ni tf)e Kic:tnrs ttiat etU, h i c> des:>iqned to measure and 
;>i:pp(>r"t. mn 1 t 1 (1 1 mnr ^ s I .>ria i 1 t >- cif ^>elf ri/OCf.^pt (e.g., Mar r4i , 

Cairias, ic^r n€-;> .^v Ticlni.^n, 1^8^; M.u -^f) S#OMhi.»il, 198/1; M^r Eh, Parker 
8^ rcArrH.>s, in rr e^;s; Marv.h, F^e 1 i ch Smith; 1985; riru^-,h, l-imitfi T< 
Bc^rnr-.>, iy\ press; iWiaveI-.nri & Marshy in press). (Jft)er researcri with 
^.itt' SIX) I f ^st. r umcnt.s has sf)t.:»Hfi Mud.: t,0 t.fu-» r i-- 1 i -?ib H i t y of each 
t^^i.tnr L^t.>ru.>r a 1 1 y \n th,.> O . 80 ' s and O.?0'^ whiU.» r: or r p i at i ojv.o 

MT.r.f.ij tfv> fcutijr^> are modest (fiieciian r 's . u cjeru^raily ^.J. ^'0 c/r 
1^'^'^^*^ ^''^ t^u? sei f ccmcept f cU t or-.^ are snt)<-, t r.^f 1 1 1 a 1 1 y corr^iMated with 

f fofMt/pt-^ ID *naiff)iMr| inf€?rrecf by teachers 3.r\6 by 

^ :(.;fM f a ,wit ntrnv s \ (1 .u ^^h, fc^r nes -'^^ Ho( e-var , in pr^s Mar-.^l 
'j'Wi^iK 19^4; Marsh, Smi tfi ^<arn^-., in (:ir'-.,^>; Marsf), Smxtfi, 

}:.^f nt'v., l^utler, 1 ' ) ; afxl, < ) t.fu- i ^ c one cp t s jn acacJen)!'. arear. 

^ • ' ' - t.- • t .u^ t. 1 a 1 ] •/ f ^.»r r tM ...M t with ai - (tj i ( . < h i ^ ^ m;i* - f . i i j Mh • ..^ t r>f s 
wh 1 n..narade?.c so 1 f - c c.x u. f-f) t >^ <.u e fU)t. iMvirsh, 1^^134; Mar^fi 
r'c^r f cr , Mar^;r), Parker Barr^.-.^s, m [•)ress; fiar-<^ri f)'Ni^-!l, 

IVK^l; Marsh, f\^rU>r & Smithy !985; Mv>r h, '^rith, Baroe^^ ?- Butle.r^ 
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l''?83)* These findings support thewa^lidity of interpretations based 
on the SDQ instruments. 

The SDQ III, the rationale fur its construction, its relation 
to the Shavelson model and the other^SDQ instrument^,' the wording of 
the items, its psychometric properties^ its relation to academic 
achievement and to self-concept inferred by significant others, *is 
summarised elsewhere (Marsh, Barries & Hocevar, in press! Marsh^^ 
O'Nieli, 1984). Each of the 13 SDQ III scales is represented by lO 
or 12.*items,, approximately half taf which are negatively worded, and 
^ubjecVs respond on an eight-point response scale where categories 
vary from " 1 -Def i ni tel y False" co i ni tel y true." The 13 

scales are described below, 

I, Math I have good mathematical skills/reasoning ability. ^ 
Verbal I have good verbal skills/reasoning ability. 

'xj General Academic I . am a good student m most school subjects. 

4. Problem Solving — I am yood at proble^m solving/creative 
t h 1 s\\ 1 ncj - 

1:., Physical Ability — I am good at sports and physical activities. 
6- Appearance I am physically at tract i ve/good i ooki ng . 

7. F\i?l at 1 or>«^ With Game Sex I ^l^^vf.^ goud 

^ 1 n t er ac t i ons/r el at i anshi with member of thi» same sex • 

13. Reli-^tiof^s With the Opposite Sex I have good 

1 f) t er ac 1 1 ons/r el at i onshi ps wxtfi members of the opposite sex. 

9. Relations Witti Parents 1 have* good i n t er ac 1 1 ons/re 1 t i on^^h i ps 
with my par en t s . 

10. Re I X Qi on/spi r i tual 1 1 y - - I am a religious/spiritual person* 

II, f-icjiie^-^t y • I am a?) hr>f)f»st , rtMi..^})li», t r Ltst wor t hy, person - 
Crnot. I of^al Stability I c^m an €'mat i iJfiM 1 I y stable person. 

\Z. General -Sel f I have sel f -respect , self-confidence, <ii3lf ^ 

rep t anf_e, positive ^el f -f eel i ng^i and a goad sel f ~ concept . 

Tb^ [^'y't^ter !?0n sc'cile, F<attur (1966; Rotter, Chance Phares, 
l^v;2; also see MacDonald, 1973) describes the Kotter LOC, its ' 
thi?oretxcal basis, and empirical studies based upon it. Internal- 
evternal locus of control i defined as the extent to uhich a person 
pi^rceives reinforcement as t.on 1 1 fige?nt upc>n tiis/her own t)eh..svior or 
irjdependent of it. The locus x is internal when the person perceives 
that events are contingent upon his/her awn behavior and relatively 
permanent personal character i ^.t i cr^, c^rtd €/;;ter:nal whert they are seen as 
contingent upon lucK, fate, the ccuitrol tjf power fid others, the 
environment, or some other characteristic not under his/her own 
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control . 

The Rotter LOC consists of 23 question pairs, using a forced-- 
choice format,, and six f i 1 1 er qu(?st i ons» Each pair of questions 
contains one internal statement and one external statement? the 
total score is the number of external responses. Rotter (1966; 
Rotter, Chance & Phare5,i^V972) presents internal consistency 
estimates of reliability of about O,70 from a number of different 
studies- Test-retest estimates of reliability are nearly as high, 
though they are more varied and generally based upon much smaller 
«-ample sizes. Factor analyses described by Rotter and by others 
suggest that there is one general factor which accdunts for most of 
the variance in the total ^ore^ though MacDonald <1973) suggests 
that the scale is not as factorial ly pure a^* it was onc& believed to 
be- Validity research witfj the Rotter LOC is summarized by Rotter 
^96&; Rotter, Chance ^ Phares, 1972) and by Lef court (1976^. 
• Subjects^ Drgign and FC9t§^^yC?§ / "-^^ 

Subjects are the 361 participants, aged 16 to 31 (median age = 
2i ) who completed one of lO standard Outward Bound courses offered 
between No vemb^r* 19S2 and May 1^83 at one of two dif forent 
Air^tralian sites. 967. of the participants were siaqle and 757- were 
male. Participants represented a wide range of socioeconomic 
btu: kyr ounds. About one-third the participants reported that they 

fi'3d t)^»e^n full -time students during the yi^ar prior to the course, and 
2S7- ^aid that they would be ^^tucJento the following year. A majority 
[SOX) had been employed nn full -time baSiv^ cii^ring the year prior 
tb the course. When presented with a i i *it of o(.; t i on«j of what titey 
hoped to get from Outward Bound, participant'^-' indicated: brc^aden 
outlfooK aruJ ^perience (82%), learfi new outcJoor sJ<iIIs CS9;i) , 
(ievelup char ac;.t er /per sc^^al 1 t y (S?^)^ i.hance'tn get fit (487.), have 
an enjoyable time (377.), develop ability to mix Socially (27r.), -^nd 
h(jl 1 day ( ISy.) , 

Ffif ollments fur t\)€? 10 different courses varied between 15 and 

57, Hnd within each course par t i r i p^uit s w£^?re divided. into groups of 
about 12. Of the 27 group^^ of participants, ! ^3 comprised all males, 
3 a) J female'3, and 9 were iDi^ed^sex groups m wtiict) between SiX and 
!5Q7. n-f the? members were femaif?- r<.H-h cukit ^^yi:^ was actiiaily cofiducted 
at one* ui two (jifferent Aixstralian sitt^r>. 

F'ar t ic i pants completed the? IU)0 III r^pprox i mcit e^l y one month 
before the st^rt c,>f thf> course; togethfi?r vAi t>> i uuf r eg i ^st r at i on / 
tnaterials ttini'e l) ^ agarn on tfie fir^a day of the? c nurse? (time* ?), 



Outward Bound 10 

and also on the last day of the 26~ctay^ course (time 3). The LOC was 
completed at times 2 and 3, and bat kground/demographi c information 
normally collected as part of the Outward Bound program was 
collected at time 2. At time 1 materials were mailed to participants 
and completed in their homes, while at tif.ies 2 end 3 materirils were 
completed at the Outward Bound site. Since the data collection was 
conducted as part of the program, there were very few missing 
r expenses ; 348 participants comp 1 eted the mater ials ail thr ee t i mes 
and none failed to complete the self ^report surveys for more than 
one of the testing points. 

Based upon previous research and the aims of the program 
described by Richards (1977), ,it was predicted th<^t participation in 
Outward Bound would result in more positive sel f -concepts and a more 
internal LOG. However, the SDQ III clearly differentiates among J3 
facets of self -concept which appeared- to have varying amounts of 
relevance to the program. Consequently, before the start of the 
d«:\ta collection, the Pr aqrani ,^0; r t?c t or wav? asi fd to judge the 
relevance oix each SDQ III scale to the aims of the program, and 

IV 

wfie' ner the relevance would vary for miMed-se:< and singlt?-seK 
qroup*^- These judgments were i ncnrfxir atecJ in subseqi^^n t analy^^-es. 
at 1 T^t 1 cal Anai y^i 

F^sychomptr X c Frof3erti.es of the Gl)Q III and the l.OC- In 
analyses of responses to the SDQ III, responses to negative items 
w#^'re reversed so that a re^spun^u- of '^S** always represented thi.^ most 
{)o^.^jtive re^pon^f-?, i nd i c; ai i r uj ^ higher ^l?^concept» Tht? mecJian 
r e?'spanse was then subst a tut i-ni for the few missing vaKtes that 
e-istod (less than 1/4 i if 17.) in complet^^d questionnaires. The 
(. cunmer CI al 1 y available SPSS program (Hui 1 S< Nie, 190!) was used to 
det er rn; ne "^.he? internal r ons i t eruy estimc^tes for ec^ch SDQ III scale 
soparately at time 1, time 2, and time 3, and also for difference 
scores represent 1 ncj the time 2 to time 3, change in responses. Then 
tf\L^ 10 <ur 12) items from earh scalf? were divided into 5 <or o) 
3 t pairs such"" that the first £wu items* in t fie «ic:ale were, assigned 
\ f hf^ f i»-st pair. tl>e next iwo items to the second pair, and so 
forth* Factor analyses were performed nn the sum c^f responses to 
t fie two items in each item pair a<-> i typically^ cinne in other SDQ 
rr^vearrf-t (':^ee Marsh .?v Q^Nitill':. 17^4). Tfie frM,tor .uialy^.es, usiny 
iterated co'mmunality estimates and an'oblique ratat:c3n* fallowing a 
^ di ^er normalization, were perfurmecJ with the commc^r l i a 1 i y avaiiat)lt- 
'T.PS?; prtigrcim (Nit?, et a]., i'^'^'.j) . Separate factisp Aoal yses werg* 
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performed on responses from time 1 <n=357) , time 2 <n=35S) , and time 
3 <n=355), and then a combined analysis was performed on all 1,070 
sets of responses, with responses by each subject at each time being 
considered to be a separate case. Factor scores (Nie, ex: al . , 1975) 
were derived from this combined factor analysis to represent thie 13 
SDQ III scales at each time separately so that all the scores were 
based upon a common metric aj^ so that mean differences between 
different times were not lost. Across all responses from time 1, 
time 2 and time 3, each of the 13 SDQ III factors was standardized 
to have mean ^50 and SD = 10. A similar factor analysis was also 
performed on the difference scores representing the change in time 2 
to time 3 responses to the itent pairs- Finally stability 
coefficients were coinputed for the time 1 to time 2, time I to time 
3, arid time 2 to time 3 intervals. 

Responses to the chotomousi y scored LGC items were scored so 
that " 1 reflected ar rA'ternal response and "O" an internal 
re^sponse? the total score represented the number of external 
responses. Internal consistency estimates were determined separately 
for LDC rt?sponses for time 2 and for time 3, and the stability 
coefficient was determined for t;he t^i fie 2 to time 3 interval. 

CrPQC^fD ImEg^t. The nature bf the Outward Boupd program, a 26 
clay residential program scheduled at specific times of the year in 
different parts uf the ciountry, meant that randomly assigned No 
Trec^tment and Placebo control grnup=^> \^or i? ru)t feasible. Thus, the 
eJt^v^icjn of the study is a critical feature. For purposes of this 
study, the design is characterized as a multiple interrupted time 
-f^riec^. Textbooks dealing witf) the. st^^tistiral analysis of time 
s€^r ic-n typically suggest testing at it?r^st 50 points in time, i r^stead 
<jf just three. Hc3wever, as emphasized by Cook and Campbell (1979), 
tht- advantages of i^smo eveir> a "short** tinif? series are important, even 
Xf 8o^-JenKins types of aruAlys€-?s are not appropriate. We choose to use 
the tt^rm time-series to emphasise the difference between the present 
design an(j the typical one-groitp pret f^v,t -post test design, though we 
recciKjni.-e that the "short*' time series IS al SO d i f f er ent -from longer 
t 1 series such as those di^^ciissed by Cooic and CampbelJ (1979). 

In a preliminary analysis of responses to the SDQ I 11^ a 13 
(sel f -concept factors) by 3 ^time^s) w x t h i n - ^uih i ch, t ANOVA wcii; 
conducted for the-? 348 subjects who had completed responses at 1 
three times with the cdmmerciaily available MANOVA routine from the 
GPSG package (Hull Nie, 1981). A 13 by-2 ANOVA was also cunciucted 
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to examine shifts in r^ponses during the time 2 to time 3 intervals 
Based upon the results of these analyses, more specific comparisons 
were made for particular self-concept factors, as described in the 
Results section. A paired t^test was also used to test for tne 
statistical significance of differences in responses to the LOG from 
time 2 to time 3. A more detailed examination of the prograr.> impact 
was made in further analyses of difference scores representing 
measures collected at time 2 and time 3, and i no^rporating the 
Program Director's judgments of the relevance of each of the 13 SDQ 
III factors to the program. 

Because of the large number of statistical tests that were 
conducted, significance testing was done at the O.Ol level, and also 
at the even more conservative 0.001 level. However, partially 
because of the large sample size, differences significant at O.Oi 
were usually significant at 0.001. Since each of the self-concept 
scores was scaled to have a standard deviation of 10 across all 
responses, mean differences divided by 10 provide a measure of the 
change in self -concept scores in standard deviation units. 

Resul ts and Di scussi on 
^'?Y?.hna}?t!:ir- i^COB^Cties of the iiDQ llj an^ the LOC Scale.^ 

The SDQ III is designed to measure 13 facets of sel f -concept ^ 
and four factor analyses were performed on responses from the 
instrument; separate analyses on the responses from each time and 
tfien a combined analysis across all three sets of responses. In each 
of the four factor analyses, the 13 SDQ III factors were clearly 
identifieni (see Table 1). Factor leadings fcr variables designed ko 
measure each ?acto^f target locidifjg^ were hiqh (median = 0.72), 
wfjile nor^target leadings v^re Small (median C. 02) . In each of the 
four factor analyses the sut^l lest target loadir>9 Was ^-^ubstan t i al 1 y 
larger trian the largest nofitarget loading. Cor r€?l at i uns arrionq the 
far.tC3rs were modest, ranging from -.07 to 0.39 (median = 0.10). These 
findings demonstrate that SDQ III fact(3rH identified in other r^ettings 
generalize to the responses by Outward Bound par tic i pantii, and that 
par t i c i parHs' sel f -concept s in different areas are quite distinct- 

Insert Tat)] e 1 About Here 
Coefficient alpha estimates of reliability were determined for 
each^ of the SDQ III scales for times 1, Zand S. The coefficient 
alphas (Table 2) were consistently in the 0. 30' s and 0.90'3 (median 
alpha ^ 0.90 at each time) for all but the HOfiesty factor wf)ich also 
had Inwe- reliability estimates in other SDQ III studies. f^iHhiiity 

14 
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coefficients from time 1 to time 2 were also in the O.SO's and 
0.90's (median = .87) except for Honesty (See Table 2), apd were 
only marginally less than the internal consistency estimates. 
Stability coefficients for time X to time 3 (median = 0.84), and for 
time 2 to time 3 (median = 0.83), where time 3 was after the 
intervention, were slightly lower; this may reflect the impact of 
the intervention. These findings demonstrate that each of the SDQ 
III scales is internally consistent, and stable over time. 

Insert Table 2 About Here 
The internal consistency estimates for the LOG scale are 0.71 
and 0.70 for times 2 and 3 (the LOG was not administered at time 1), 
and were similar to results in previous research with the instrument 
reported by Rotter. The stability coefficient from time 2 to time 3 
was 0.68 (see Table 2). 

Program Imgacti Change in Responses Before and After The Intervention 

Responses across ail 13 SDQ III factors differed substantially 

with the time of testing (F(2,696) = 136, p < O.OOl), but these 

differences due to time of testing varied significantly with the 

specific SDQ III factor (F(24,8352) = 15.2, p < 0.001). Mean 

responses to different scales were then examined separately since 

the ti me-by-scale interaction was significant. Inspection of th«?se 

freans (see Table 3) indicates that responses for the separate scales 

tienerally show Jittle chdng©. or a sfna-l I drop from time 1 to time 2, 

-then a sharp increase from time 2 to ^ime S. A time 1 to time 2 

dpclinf? IS statistically significant (p < 0.01) for 5 of the 13 

scales, while there is a sma-l i but si Qni f i cant increase for one 

scMe. The time 2 to time 3 increase ?s statistically significant 

and positive for all 15 scales, but the sire t>f the increase varies 

v#ith the scale. Inspection of the? means in Table 3 also indicates 

that there is a ^significant shift in response-s to tKe Rotter LQC 

scale which represtf^nts a chdnge hnv^&rds more internal responses- 

Insert Table 3 About Here 

The small decrease in some nf the sf*! f - rf>ncept t;calc^ from time 

1 to time 2 probably reflects the different settings in which the 

instruments were completed, and the proximity of a challenging or 

'i.tr es^^f ul experience. At time 1 the instrument was completed by 

par t 1 c: 1 pants in the nont hr eaten i ng environm€?nt of their own homes, 

about a month before the start of the course. At time 2 the 

instrument was completed in a new environment on the first day of 

trie program. (In a popular magazine account one participant 
r ,»• ■ -.7 

15 



Outward Bound 



14 



described his first day: "We were ail apprehensi ve. . . we knew nothing 
but <possibly> stretched tales about the limits to which we'd be 
pushed in activities we had not experienced before... we were all 
scared that we might be found wanting.'*) Consistent with this 
interpretation the largest time 1 to time 2 decline was for self- 
concept of Physical .Abilities which is probably the most stressful 
aspect of the program. Particularly for the Physical Abilities 
scale, the positive impact of the program was based upon the time 2 
to time 3 comparison, but there was no significant change between 
responses at times 1 and 3. The design of the study, the fact that 
time 2 and time 3 measures were admi n-i stered under similar 
circumstances and in the same physical setting while time i measures 
were not, and the interpretation of the time 1 to time 2 shift 
presented abover^all dictate that the time 2 to time 3 interval 
should be used to evaluate the program effects. Nevertheless, the 
shifts, particularly in the Physical Ability scale must be 
interpreted cautiously. 

An important feature of this stud/ is that the Outward Bound 
program was actually e5(perienced by 27 reasortably distinct groups of 
parti Ci paints wh^ch operated in different locations and at different 
time^. during the year. Thiii mtdtiple replication of the time-series 
design provides a test far i d i o^sync r at i c: , time-related occurrences 
that are a threat to the validity of conclusions ba?ied upon a single 
1 1 fTie-seri es (see Cook S( Campbell, 1979), In order to examine the 
generality of the program effects acro'-,s t fie groups, time 2 to time 
3 difference scores were deter mi oeti for responses for each of the 
Si'Li III fcuitors, and differences Isetween the 27 gi-oMps wej-e cOmp©re<i 
for each of the 13 scales. In only two uf the iZ oije way ANOVAs did 
differences between the 27 groups reach statistical significance at 
p < 0.05. and nona cf the differences was statistically significant 
at p < O.Ol. (One of the significant differences was for Relations 
Witf) the Opposite Se;-: and this was expected since one-third of the 
groups were mixed -sex and the rest were single-sex -- see further 
discussion below). These findings indicate that effects of the 
()ut,**ard EiQund Program were consistent across the groups and strongly 
'-:.upport the generality of the findings. 

Lack of a Np-Trjeat men t Control Groupi. Changes m SDO III 
responses were consistent with the Outward EUjund goals and points to 
the program's ability to enhance on sel f -concept . The relative lack 
of shift, or slight decline in self concepts over the time 1 to time 
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2 control interval, suggests that the time 



2 to time 3 increases were 



not due tj regression effects, spontaneous remission, and other 
threats to the validity of findings that are typically assessed in 
experimental designs with No Treatment control groups. In support of 
these conclusion we propose that a multiple repJ ' ation time-series 
design is nearly as effective as an experimental design with a No 
Treatment control. This position is most defensible in a study such 
as this where shifts in the criterion measure during the pretest- 
posttest interval are unlikely without any intervention. When the 
design of the study, the selection of subjects, or the nature of the 
criterion make it likely that there will be a systematic shift 
without an intervention, this position is more dubious — particularly 
when the direction of the likely shift is the same as the intended 
intervention effect. Even when the criterion measure is likely to 
change without any intervention, more sophisticated time-series 
design^, where there are a sufficient number of data points before the 
< r^terventlc>n to establish the trend in the shifts and/or the point at 
which trie intervention is introduced xs systematically varied, may be 
viable (see Cook Se Campbell, 1979; Smith, et al • , 1975). 
Fast Groug Eughgria and Program IfnpacHt. 

^ At the end of intensive group experiences, par 1 3 c i pant r> 

typically experience elation or good feelings that will be referred 
to fiert? as gost group euphor ia (PGE) . The r onc ern here i ?b not 
wriether a PGE exists, or wheth€?r it is good or bad. Rather, the 
criticr^l (question is whether other measure?^ are biased by a PGE so 
thai they do not validly re?fiect t^)e impact of the intervention 
par t i c. Ill ar I y mea^^ures has&d Upon Subjects* sel f -report 5- 
Conceptually, PGE bias. may be like a Hc^wtfiorne Effect, a placebo 
effec:^, a ruUo effect, an experimenter acquiescence effect in which 
^iibj£::^cts try to make the program lool good, or even a cognitive 
ciassofiance efrect where the time^ money and effort expe?nded by 
par t ic I pc^nts influences their ratings (these biases are not 
r. one eptual 1 y distinct and thore may be considerable overlap among 
tf^em). The possibility of PGE: bias is macie more plausible by the 
fc^ct that there were significant changes in all 13 SDD 111 scales, 
evnn tf^ough the scales differ substantially in their relevance to 
tho e; pr essed goals of tf^e pr ogram- It could be argued that tf^e 
^flgher ratings on all the BDQ III scales are a consequence of a 
single global bias or halO e-Tfect prc^UCed by the PGE rather than by 
the intervention. Thus, A PGE bias, if it exists^ represents a 
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source of invalidity in the self -concept responses* 

There are two separate issues related to a possible PGE bias: 
-first, whether the inferred short-term effects of the program and 
the sel f -concfept responses are valid or whether they are biased by 
PGE; and second, whether the program effects are maintained over 
time, based on the assumption that a PGE is short-term. It is 
important to realize that the short-term gains may be valid, even if 
they are not maintained. For example, in a weight-loss program, if 
participants lose significant amounts of weight during the course of 
the program, the weight loss is "real" even if they eventually 
regain the lost weight. <lf subjects' sel f -percept i ons of their 
"thinness" were used to evaluate the intervention, then changes in 
self reports might be duj£ tt> a PGE bias). Similarly, the physical 
endurance and stamina of virtually every Outward Bound participant 
was substantially increased by the end of the program, whether or 
not tfie new level of fitness was maintained. Whiie both issues are 
important, the focus of the present d i sc^u^^si on is on the validity 
issue. Rephrased in terms of intervention research, the question 
becomes whether the Outward Bound program actually affects multiple 
dimension-3 of sel f -concept , or whether ttie ^ipparent short-term gains 
are an artifact uf a PGE. In terms of sc^l f c(jru:ept research, the 
<]^te->tion IS one of the validity of the s;el f ^concept responses in 
1 f jter vent i on research. 



^.rovides little control f cir such an effect. Furthermore, a potential 
f Gr bias cr^anncpt be tested with a randomly assigned No Treatment 
control group , and so that desngn may also be weak for examining 
intervention effects where PGt bias 1 1> iikelv. Randomly assigned 
placebo control groups are sometimes used tcj e;:plare such counter- 
e^:p 1 anat 1 ans. Traditionally, a placebo is a neutral or null 
^i^timuliis given to the subject as if it were the active treatment, 
thus allowing the researcher to sep.u ate treatment effects from 
psycfiol ogi cal reactic^ns produced by the placebo. Placebo controls 
r^r e particularly effective when the actual intervention effects and 
svtbjects' psychological reactions to the i nter ven 1 1 on are 
c onceptt^al 1 y distinct, as may be the case in medical researrJu 
fiowever, placebo control groups are less viable in studies such as 
))ero where: a) the dependent variables, ^uch as sel f -contempt and 
IOC, are based upofi subjertf;' psyrfiol oy i c al reactions; and b> it is 



A PGE bias is an important threat to the validity of any 
c (jfK 1 u^.. 1 {jns based on nelf -report data.^ and the time -series design 
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unlikely that a placebo could capture the psycho! ogi c;»l experience 
of the Outward Bound Program and still not be related to the Outward 
Bound goals which are tested in this study (i.e., no "Outward Bound- 
like" intervention is likely to be neutral with respect to its 
effect on self-concept and LOG). However, additional tests, based 
upon a more detailed analysis of the time 2 to time 3 difference 
scores, can be used to examine the viability of a PGE bias. In each 
of these analyses it is assumed that if the change scores for 
I onceptually distinct SDQ II 2 scales are similar in magnitude and/or 
highly correlated then there is support for a PGE bias, but also 
that if differences for the various measures differ in size and/or 
are relatively independent then support for a PGE bias is tenuous. 

SDQ III Difference Scores^ In the first set of analyses of the 
SDQ III difference scores, their reliability and dimensionality were 
examined. The logic employed here is that if a PGE bias exists, 
reliable changes in responses from time 2 to time 3 will reflect 
primarily this one factor and will not be specific to different 
areas of self-concept <st?e Marsh, Smith, Butler & Barnes, 1983 for 
further discussion). First, difference scores for each of the 10 
(or 12) items representing the SDQ III scales were used to estimate 
the internal consistency of each mear^ scale difference. These 
reliability estimates, tfiaugh lower than those presented in Table 2, 
were substantial, ranging from 0.52 to 0.82 (Md = 0.67). Second, 
difference scores between time 2 and time 3 were determined for each 
Item pair as described earlier, and were factor analyzed. A total 
of 21 eigenvalues were greater than 1 . 0, and the f irst factor 
accounted for only 13*/. of the total variance. In a l3-f actor 
i.olutiDn, iiearly ail of the 13 SOU Hi factors could be interpreted, 
th'jLujh the solution was not nearly as clear as for the analyses 
mm ar i ed in Table 1. 

Finally, di f f er edces in the SDQ III factor scores were 
correlated with eacf) other and wjth differences in the Rotter LOG 
scale. The size of the correlations among the SDQ III factor 
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statistically significant. 

The analyses of difference scores show that time 2 to time 3 
changes in the SDQ III responses are reliable and multidimensional, 
and that the dimensions underlying the change scores are similar to 



clearly indicate that shifts in ratings between time 2 and time 3 
are not uni di mensi onai , and that shifts in self-concept are specific 
to particular dimensions of sel f -concept • However, the technical 
difficulties involved in the analysis and interpretation of 
difference scores dictate that these results must be interpreted 
cautiously <see Cronbach Z< Furby, 1970; Marsh, Smith, Barnes 5< 
Butler, 1983) • Nevertheless, the findings argue against the 
€r»Kistence of a strong uni di mensional halo effect sucfi as a PGE bias 

the cause of the time 2 to time 3 shifts. 

Jhe l^agni^tude c)f Change in Pitfgrent Scal^es Before the first 
data collection, the Director of the Outward Bound program was asked 
to iruiicate his perception of the relevance of each of the SDQ III 
scales to the goals of the pr ugram. While noFie was absolutely 
1 rr el eva4'it , four Religion, Mathematics, Academic, and Relations 
With Opposite Sex were indicated to t)e clearly less relevant- In 

triii:> e>ense, the less relevant SDQ III ^-^tale^ serve as a control for 
-3 PGE bias. If shifts in the less relevant goals are 'significant and 
^^pproach the size of the more relevant goals, then thpre may be 
support for a PGE bias, though other e:rpl ariat i or^^ would be viable- 

T 1 iTie? 2 to time 3 changes were ^tat i st i caJ 1 y significarit for ail 

SDQ 111 dimensions (Table 3), but thie magr ji tude of the 
(1 : f f er ef)Cf?s varied. This spt^^cul at i on was coDfirmed by the results of 
af) addxtiof^al ANOVA conducted on ret.pofmeB to the 13 SDQ III 
fcutors. Across all r^caies the- shift was -statistically ^significant 
( F(I,3Si) ^ 26K6, p < 0.001), but tfie size of the shift varied for 
the different sicales as indicated by the i i me- by -seal e interactiof> 
(F( 12, 4212) - 12.2, p < 0.001), The mean time 2 to time 3 difference 
^core for the four "less relevant" scales was less than half that for 
the mean of the other nine r.caies (see Table 3;, and the difference in 
c h<:^nges for the two sets of ^^caies was highly significant <t(351) - 
'^. '^y, p :: O.OOn. These findifigs demonstrate that the size of the 
s^ufts in sel f -concepts varied substantially for different scales, and 
that the differences were larger for thu-fe scale^- triat were a priori 
crujsen to be most relevant to tfte goals of the program- Tfiese 
findings offer further support agdin^t the er:istenre of a PGE bias and 



those ropreeeoting the SDGj XIl at times 2 and 3. The results 
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support for the program as having achievevi it aims. 

di?iedz§§>i §Ci^ Singie-sex Grgugs^ When evaluating the relevance 
of the different SDQ III scales, the Outward Bound Program Director 
indicated that Relations With the Oppor^ite Sex was often not 
relevafit to the Outward Bound program because group members were 
typically of the same sex, but that this scale would be relevant ^or 
mixed-sex groups. As indicated earlier, 9 of the 27 groups were 
mixed^sexj, consisting of between 31% and 50X females. Re^^ults from 
the single-^sex and mixed-sex groups were compared on time 2 to time 
3 differences for each of the 13 SDQ III scales. While the results 
failed to reach statistical significance for 12 of the 13 scales 
(all p^s > .2)1 mixed-sex groups experienced a significantly larger 
improvement in Relations With the Opposite Sex scale <t<35.1) = 4-5, 
p < 0.001). These findings indicate that changes in self -concept 
experienced in single-gSpc rind mixed-^sex groups did not vary except 
for the one scale that was chosen a priori to be most relevant to 
this group difference Relations With the Opposite Sex. Again 
these findings argue against the effect of a PGE bias. 

Summary of Discussion of A Possi^bl^e Post Groue Eughorla Bi.as^ A 
variety of analyses were conducted to test a counter-interpretation 
of the time 2 to time 3 shift asT a 5in9ley global halo effect such 
as ^ PGE bias. The shi-Fts wer^ clearly multidimensional, and shifts 
m different areas of self concept were nearly independent of each 
other and of shifts in LOC. The size of the shifts varied 
SI grn f I r ant i y , ar^d significantly larger shifts were observed in the 
GDQ III dimensions chosen a prion to be more relevant to the 
Outward Bound goals. F-inally, i f ig 1 e-'sex aruj mixed -sex groups 
differed only in terms of shifts in Relations With the Opposite Bex , 
bitt. ru)t in terms of the other SDQ III factors. These findings 
demofistrate that the time 2 to time 3 shifts were quite specific, 
and consistent with a priori pr e(i 1 c 1 1 orn^ based on program goals and 
group c^)ar acter 1 st i cs- It is still poH^sibie that such a bias had 
snmp effect in addition to the systematic effects produced by the 
Outward Bound program, or that specific PGE biases varied 
systematical 1 y wi th the crintent of each SDQ ^^cale. Nevertheless, the 
^^ystematic pattern of results argues ag^nnst the existence of a 
strong, global PGE bias- 

A PGE bi as represents a ser i ous threat to the validity of 
intervention studies that esnploy Self^^repOr^ crit^i^^ and to the 
validity of sel f concept response tn sucH reseairch, A randomly 
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assigned No Treatment control group offers no safeguard against such 
an effect, and placebo control groups may be inappropriate. Thus, 
the further exploration of alternative tests of PGE effects is 
important. A particularly powerful test of a PGE bias would be the 
inclusion of "control" criterion measures that are likely to be 
impacted by PGE biases but that are unrelated to the hypothesized 
effect of the intervention. While such approaches are rare in 
experimental research and intervention studies, they are frequently 
used in studies designed to test the construct validity of responses 
to psychological measures; tU& logic of the approach is similar to 
that used in mui t i trai t-mul t i method analyses. In intervention 
research, threats to the interpretation of intervention effects are 
potential sources of invalidity in the measurement instruments, and 
<^uppart for the validity of the measurement instruments provides 
support for the interpretation of the intervention effects. 
P iir 1 1 cui ar 1 y in true experimental studies, this inexorable link 
between measurement issues and the valid interpretation of the 
resists IS given insufficient recognition. 

Summar y Bj)^ I mp i IQ^tt^pns 

This study supports the Otitwar d E«ound fVogram as an effective 
i nler vefit i on for c^^a^gin^ multiple di merr.oX onsb of self-concept and, 
^>r:>r h3p<_^, locus of control. T^le findinqs, taTen together with 
r^rt^viou^s rese^arch, also ^>iApfH)r t the validity of interpretations 
ba<:,ed upon responses to tht^ SDQ III. and itr. usefulness as a 
f Titer ion measure far intervention studies. Psychometric properties 
i>i the SDQ * I I I reliability^ f^tor structure, and correlations 

art^ong cJifferent d i mef i^i^i qd^^ wt_^r e t o?i^^i*>tent at t i (nes 1, 2 and 

and were consistent with earlier research based upon student 
responses. In additiofi, stat^lity coefficients over the one-month 
tinie 1 to time 7 interval w*>r very hlg^^. Ki really, a powerful 
intervention i^pec i f i c: al 1 y designed to affect sel f concept was shown 
t^.^ significantly influence responses to the SDQ III, and the largest 
effects were ob':>erveci with the dimensions chosen a priori to be more 
rf?levant to the goals of the program. fhuS| s<?l f - c one: ep t was 

6.=>1 at 1 vel y stable, but changes did take place that were r^jiiable, 
^^pocific to particular dimensions of sel f -concept , and 
1.^ / s t ema t i cai 1 y r el a ted to the qoal s of t he i n t t?r vent i on . 

Theoretical issues of impurtarice in thi^^ study were the 
stability and systematic change in multiple dimensions of self- 
ccmrept, the ability of efff?ctive i n t er vc^^n t x mis to enhance «.>elf- 
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concept, and the methodology of such research. Good longitudinal 



research l,n" sel f -concept is rare (but see Bachman S< 0' Mai ley, 1977; 
Dusek li Flaherty, 1981? Maruyama, Rubin & Kingsbury, 1981), as are 
successful intes vention studies that use wei 1 -desi gned self-concept 
measures (see Rosenberg, 1979; Bcheirer & Kraut, 1979; Wiley, 1979), 
and studies that assess systematic changes in multiple dimensions of 
self -concept are even rarer. Hence, the findings of this study are 
Important in that they provided information about systematic change 
and stabi 1 iXy i n mul t.\ dimensi ortal ^ei f-concepts, and demonstrated 
iricit self-concept can be changed throug|i effective intervention. 
From this perspective a critical concern is to identify the 
characteristics that are important for effective intervention in 
self -concept research. 

Three characteristics of the present study were particularly 
important to the demonstration that self-concept can be enhanced* 
First, the nature of the intervention, the 26-day residential Outward 
Bound program, was particularly powerful. Second, the sample size 
was sufficiently large so that observed changes in self-concept 
could be differentiated from random error* (Also, the use of 
subjects as their own control strengthened the power of the 
statsstical tests). Third, the SDQ ill assessed multiple dimensions 
of self -concept that were clearly distinguishable and some of these 
facets were particularly relevant to the goals of the intervention. 
No matter how strong the intervention, and how powerful the 
statistical analysis, intervention effects are lil^ely to be 
negligible if they are assessed with poorly designed or irrelevant 
measures. Most research has relied on an ill^defined measure of 
overall or total sel f -concept , rather than specific facets of self- 
concept that accurately reflect the goals of the intervention. In 
f ut are research we strong 1 y recommend that these three 
characte^-istics, along with appropriate controls for PSE biases, be 
incorporated into the design. 
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Table 1 



Summary Statistics For 


Target 


Loadings, 


Nontarget Loadi 




and Factor Correlations in 


une rour 


racvQr f-tnaxyses 








I 


II 


III 


IV 


Target Loadings 


Hi ghest 


• 94 


.95 


.95 


.95 


(68 factor 


Lowest 


. 40 


.44 


.36 


.49 


loadings) 


Med i an 


• 72 


m 








7. > 0.30 


i 007. 




* 




Nontar get 


Hi gnest 


• 21 


.26 


.21 


.20 


Loadings {716 


Lowest 


14 


14 


-.14 


. 13 


factor 


ned 1 an 


. 02 


.02 


.02 




1 oadings) 


7. > 0-30 


or. 


07. 07. 


U 


Factor 


Hi ghest 


. 36 


. 38 


.38 


.39 


Corr el at i ons 


Lowest 


-.07 


-.04 


-.05 


.05 


(78 factur 


Medi an 


."10 


. 11 


. 1 1 


.11 


correl at i onsJ 


7- > 0-30 


47. 


57. 


47. 


5 



I = Time 1 responses (N===357) 
II = Time 2 responses <N= 358) 
III = Time 3 responses <N - 355) 
IV - Combined set of responses from times 1, 2 and 3 <N- 1,070) 
Note: Target loadings are the factor loadings of the 68 item-^pairs 
on the factor each is designed to measure, while all other factor 
loadings are Nontarget loadings. Factor Correlations are the factor 
pattern correlations among the 13 oblique correlations identified in 
each analysis. The "7. > 0.30" refers to the percentage of 
coefficients which are greater than 0.30. 
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TABLE 2 

Internal Consistency & Stability of SDQ III and LOG Scales 

Internal Consistency Stability Coefficients 



SDQ Scales 


Ti me 1 


Ti me 2 


Ti me 3 


T1/T2 


T1/T3 


T2/T3 


Mathematics 


.94 


.95 


.95 


.93 


. 92 


. 94 


Verbal 


.85 


.86 


.88 


.90 


.85 


.89 


Academi c 


.92 


.93 


.94 


.87 


. 86 


.89 


Proble»Ti Solving 


.84 


.83 


.86 


.85 


.80 


.83 


Physical Abilities 


.92 


.93 


.92 


.87 


.86 


. S3 


Appearance 


.88 


.89 


.88 


.84 


. 79 


.82 


Same Sex Peers 


.88 


.90 


.88 


.85 


. 77 


.81 


Opposite Sex Peers 


.91 


.92 


.92 


.88 


.84 


.89 


Parents . 


.89 


>Q9^ 


.90 - 


.88 


.87 


.89 


Rel i gi on 


.94 


.95 


. 95 


.94 


.92 


.93 


Honesty 


. 72 


. 77 


. 77 


. 77 


.68 


. 75 


Emot i onal 


. 90 


. 90 


.89 


.84 


. 76 


.81 


General 


. 94 


.94 


.93 


.86 


. 66 


. 73 


Median coe-f -f i ci ent 


.90 


.90 


.90 


. 87 


.84 


.83 


LQC Scale 














Total Score 


-/ 


. 71 


. 70 






. 68 



NOTL: Internal consistency coefficients are coefficient alpha 
estimate^, of reliability. Stability coefficients are the correlations 
between responses by the same individual at two points in time. 
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TABLE 3 

Means, Standard Deviations, and Mean Comparisons — - 

t-^tests of Significance for 
r?g3Dl SDlsJL fori Selected Pai^r3:wise Comparisons 



SDQ Scales 


Tl 


12 




T3 


Tl~ 


T2 


T2- 


T3 


Mathematics 


49.8 

(10.3) 


49. 
(10. 


8 
0) 


50.4 
(9.7) 


0. 


15 


3. 


63* 


Verbal 

/a 


50.0 
(10.0) 


49. 

(9. 


3 50. 8 
7) <10. 3) 


-3. 


20** 


5. 


88** 


CS 

Academi c 


50. 1 
(10.0) 


49. 
(9. 


3 
9) 


50.5 
( 10. 1 ) 


""3. 


10* 


4. 


66** 


Problem Solving 


50.4 
(10. 2) 


48. 

(9. 


9 50.7 
3) ( 10. 3) 


-5. 


60** 


5. 


61** 


Physical Abilities 


51.3 
(9.9) 


48. 
(10. 


0 
0) 


50.7 
(9.8) 


~1 1 . 


60«* 


7- 


95** 


Appearance 


49. 6 
( 10. 4) 


48. 

(9. 


3 

D } 


52.2 


-■4. 


30* * 


12. 


82** 


Same Sex Peers 


49. 5 
(10. 7) 


49. 
(9. 


3 
8) 


51 . 3 
(9. 4) 


- 0. 


8i 


6. 


36** 


a 

Oppositcr Se>; Peers 


49. 4 

(10.4) 


49. 
(9. 


6 
7) 


51.0 
(9.8) 


0. 


62 


5. 


36** 


Parents 

a 

Rel 1 gi on 


49. 8 

(10. 3) 


49. 
(9. 


7) 


51 . 3 
(9. 9)_ 




47 


9. 


01 ** 


49. 8 

(9. 9) 


49. 
(9. 


7) 


50.6 
(10.4) 


-Op 


98 


4. 


39** 


Honesty 


49. 5 
(9. 8) 


48. 
(10. 


7 

2) 


f51 . 9 
(9. 7) 




17 


8. 


39** 


Elmot i onal 


48. 6 
( 10. 7) 


49. 
(9. 


6 
7) 


51 . B 
< 9 . 3 ) 


3 . 


OS* 


7. 


10** 


General 
LOG Scale 

b 


48. 7 

( 10. 5) 


49. 

< 10. 


i 

1 ) 


^,'-> 'r^ 

( 9 . 0 ) 


1 . 


26 


8. 


59** 


Total Scare 




10. 

(3. 


5 
9) 


8.8 
(4.3) 






-9. 


43** 



* p < .01, p < .001 

a — The BDQ III scales chosen prion to be less relevant to Outward 
Bound aims by the program Director. 

b The LDC scale is scored in the eKternai direction so that higher 

s;cores represent a more external LOC- 



NOTE: For the SDQ scales, scores represent factor SCOres which have 
been standardized to have Mean - 50 anH <^r> - iO across responses 
from time 1, tin^e Z •nd -tifl* 3- Tnus, the mean difference between 
responses ^^im^ 2 ancf Cmo 3^ divided by iOj provides an estimate 
changes in self-concept in standard deviation units. Scores for 
"thtf Rotter LOC scale are raw responses. Responses to the 13 SDQ III 
^vcales were first analysed with o (time) x 13 < scale) ANOVA^ and 
those findings indicated that the effect of time was significant but 
ir^teracted significantly with the particular scale. This analysis 
was followed up with a series of pair-wise comparisons as presented 
in this tabl e. 
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